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Application Papers 
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Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
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DETAILED ACTION 

1 . This action is responsive to communications: Amendment filed 2/9/05. 

2. Claims 1-20 are pending. Claims 1,10, and 12 are independent claims. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 1-20 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Gong . US 6,751,776 B1, 6/15/04 (filed 8/6/99) in view of August et aL US 
2003/0028378 A1 , 2/6/03 (filed 9/9/99). 

In reference to claims 1, 10, and 12, Gong teaches a method and apparatus for 
personalized multimedia summarization based upon a user specified theme. See 
abstract. Compare to "a method for generating summaries of a video. . .". Gong's 
teaches retrieving multimedia imagery relevant to user-supplied keywords and topics. 
The multimedia imagery is segmented into multimedia scene shots. See column 6, 
lines 24-34. Gong further teaches requiring a user to select a theme of multimedia 
summarization and extracting multimedia features from each multimedia scene shot 
using natural language processing and video analysis technique. The multimedia 
scenes comprise frames. See column 6, lines 24-52. Compare to "inputting 
summary sentences, visual information, and a section-begin frame and a section- 
end frame for each story in a video; selecting a type of presentation; locating a 
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set of images available for each story". Gong teaches that the summaries include 
video, audiQ, and textual information; however, he does not explicitly state that the 
summary sentences (keyword or topics) are audited to generate an auditory narration of 
each story; however, August does. August teaches a method of digitally converting text 
to audible speech and providing the speech to an interactive language instruction 
system. August discloses digitally converting text to audible speech to a user or 
student. The audible speech is used in a language lesson where the lesson includes 
video, photographs, other media, and linking the sound files created from the text-audio 
conversion with the images and content of the lesson. See page 1 , paragraphs [0001]- 
[0004], Compare to "auditing the summary sentences to generate a plurality of 
summary audio segments corresponding to an auditory narration of each story; 
composing the set of images to selectively match the set of images with the 
summary sentences to generate a plurality of summary image segments; 
matching said summary audio segments with the summary image sentences to 
generate a plurality of summary image segments to generate a story summary 
video for each story in the video; and combining each of the generated story 
summaries to generate a summary of the video". It would have been obvious to a 
person of ordinary skill in the art at the time of the invention to incorporate August's text- 
audio conversion and linking with video and other media in a system such as Gong's as 
it js desirable to synchronize the various components of a multimedia system in order to 
supply the user with a thorough multimedia summary including video and audio that are 
in sync. See abstract of Gong in which he discusses the need for a unified platform of 



Application/Control Number: 09/908,930 Page 4 

Art Unit: 2176 

video. Furthermore, Gong suggests the synchronization of audio, video, and text, thus 
it would have been desirable to have a means to provide audio for the user-supplied 
themes relating to the video. 

In reference to claim 2 and 13, Gong teaches that the visual information is 
comprised of video shots and frames. 

In reference to claim 3, 1 1 , and 14, Gong teaches retrieving multimedia imagery 
relevant to the user-specified keywords and topics. The multimedia features are 
extracted according to the theme selected by the user. The multimedia features include 
scene shots which have a certain length. See column 6. The user selected theme and 
individual multimedia features are then obtained and relevance measurements are 
determined to see which scene should be included in the multimedia summarization. 
See column 6. The summarization includes keywords/topics extracted from the 
imagery, video, and audio. Compare to "generating story boundaries and sentence 
data using story extractor; selecting a length of a story summary; summarizing 
said sentence data to produce at least one summary sentence, wherein a number 
of the summary sentences produced corresponds to the length of the story 
summary; and ordering the at least one summary sentence based on its selection 
order". 

In reference to claims 4, 5,1 5, and 16, Gong discloses a multimedia 
summarization and presenting the scene shots to a user which could be in the form of 
media or computer-readable media which could include a slide format or poster. See 
column 12, lines 49-67. 
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In reference to claim 6 and 17, Gong teaches identifying features from each 
multimedia scene shot which includes a number of frames including a beginning and an 
end. See column 6, lines 1-67. 

In reference to claim 7-9 and 18-20, Gong teaches retrieving multimedia imagery 
relevant to the user-specified keywords and topics. The multimedia features are 
extracted according to the theme selected by the user. The multimedia features include 
scene shots which have a certain length. See column 6. The user selected theme and 
individual multimedia features are then obtained and relevance measurements are 
determined to see which scene should be included in the multimedia summarization. 
See column 6. The method also includes forming a subset of individual scene shots 
that are considered relevant and removing duplicate scene shots. See column 15. 
Gong further teaches that a time length limit is allotted to a portion of each time slot thus 
a selected multimedia scene shot's time is proportional to its relevance. See column 
16, lines 35-51. 

Response to Arguments 

5. Applicant's arguments filed 2/9/05 have been fully considered but they are not 
persuasive. 

Applicant argues that Gong does not teach "composing the set of images. . .to 
generate a plurality of summary image segments". Examiner respectfully disagrees. 
Gong discloses retrieving multimedia imagery relevant to user-supplied keywords and 
topics and segmenting the multimedia imagery into multimedia scene shots. See 
column 6, lines 24-34. 
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Applicant argues Gong does not teach "auditing the summary sentences to 
generate a plurality of summary audio segments . . Examiner utilized August 
reference to address this claim limitation. Applicant argues that August does not teach 
any selection, separation, or synthesis of media materials to create summaries of such 
material or auditing the summary sentences to generate a plurality of summary audio 
segments. August teaches a method of digitally converting text to audible speech and 
providing the speech to an interactive language instruction system. August discloses 
digitally converting text to audible speech to a user or student. The audible speech is 
used in a language lesson where the lesson includes video, photographs, other media, 
and linking the sound files created from the text-audio conversion with the images and 
content of the lesson. See page 1, paragraphs [0001]-[0004]. It would have been 
obvious to a person of ordinary skill in the art at the time of the invention to incorporate 
August's text-audio conversion and linking with video and other media in a system such 
as Gong's as it is desirable to synchronize the various components of a multimedia 
system in order to supply the user with a thorough multimedia summary including video 
and audio that are in sync. See abstract of Gong in which he discusses the need for a 
unified platform of video. Furthermore, Gong suggests the synchronization of audio, 
video, and text, thus it would have been desirable to have a means to provide audio for 
the user-supplied themes relating to the video. 

In view of the comments and rejections above, Examiner's position is maintained. 

Conclusion 
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6. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 

§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Rachna Singh whose telephone number is 571-272- 
4099. Starting in mid-October, the examiner should be reached at 571 .272.4099. If 
attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Joseph Feild can be reached on 571-272-4090. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-21 7-91 97 (toll-free). 
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